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ARO  PHASE  I  SBm  CONTRACT  NO.  DAAL03-92-C-0004 
"Development  of  Rules  for  Folding  of  Biotechnology  Produced  Protein" 

RESULTS  OF  PHASE  I  WORK 

1.  Scientiflc  Basis  for  Phase  I 

From  the  known  crystal  structures  of  globular  proteins,  it  is  apparent  that  proteins  fold  with  their 
hydrophobic  residues  in  the  interior  of  the  protein.  So  apparent  is  this  that  hydrophobicity  scales  have 
been  developed  based  on  the  frequency  at  which  specific  amino  acid  residues  are  found  buried  within  the 
globular  protein  or  on  the  surface  of  the  protein.  Such  a  scale,  however,  does  not  discern  interactions 
between  protein  moieties  that  may  alter  expression  of  hydrophobicity,  nor  of  course  could  it  evaluate  the 
nature  and  magnitude  of  the  interactions  responsible  for  the  distribution  and  the  protein  folding  principles 
that  give  rise  to  it. 

In  order  to  provide  an  experimental  basis  for  developing  rules  governing  the  folding  resulting  from 
intramolecular  hydrophobic  interactions  and  hydrophobicity-enhanced  charge-charge  interactions,  a 
polypeptide  system  is  required  with  the  correct  balance  of  hydrophobic  (apolar)  and  polar  moieties 
resulting  in  a  readily  perturbable  transition  temperature  for  folding,  with  a  readily  measurable  and  near 
optimized  response  when  compared  to  other  polypeptides  and  proteins,  and  with  low  energy  barriers  for 
conformational  interconversion  such  that  reversibility  can  be  maintained  for  the  many  applied 
perturbations. 

The  Model  Molecular  System 

Nature  has  provided  such  a  polypeptide  system  within  the  second  most  abundant  extracellular 
protein,  elastin.  The  polypeptide  system  to  be  used  is  the  most  striking  repeating  peptide  sequence  known 
in  mammalian  elastic  fibers;  it  is  well-represented  in  bovine  and  porcine  elastins  where  in  the  former  the 
sequence,  (Val^-Pro2-Gly3-Val'*-Gly5)ii,  occurs  without  a  single  substitution  for  a  sequence  of  n  =  1 1  (1- 
5).  PolyCVPGVG)  has  the  correct  balance  of  hydrophobic  and  polar  residues  such  that  it  is  miscible  with 
water  in  all  proportions  below  25°C  but,  as  the  temperature  is  raised  through  the  transitional  range  to 
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37°C,  it  aggregates  into  a  condensed,  more-dense  state,  that  is,  it  exhibits  a  phase  transition  from  a  state  I 
(the  solution  state)  to  a  state  n  (the  condensed  phase  called  a  coacervate)  which  is  in  equilibrium  with  the 
overlying  solution  (6). 

The  Inverse  Temperature  Transition 

When  high  molecular  weight  poly(VPGVG)  is  cross-linked  as  for  example  by  20  Mrads  of  y- 
irradiation,  i.e.,  X20  poly(VPGVG),  on  lowering  the  temperature  below  37°C,  it  swells  to  extents  allowed 
by  the  cross-linking  and  chain  length,  and  it  contracts  with  expulsion  of  water  on  raising  the  temperature 
from  20°C  to  40°C  with  changes  in  length  of  greater  than  a  facto-  of  2  (6,7).  Generally  molecular  systems 
(a  common  example  would  be  molecular  crystals  in  equilibrium  with  a  mother  liquor)  become  more 
soluble  and  less-ordered  on  raising  the  temperature,  whereas  poly(VPGVG)  becomes  less  soluble  and 
more-ordered  on  raising  the  temperature  through  the  range  of  the  transition.  Because  of  this,  the 
phenomenon  exhibited  by  poly(VPGVG)  is  called  an  inverse  temperature  transition.  A  naturally  occurring 
analog  of  VPGVG,  with  an  alanine  between  the  valine  and  proline,  when  repeated  and  cyclized,  i.e., 
cyclo(VAPGVG)2,  provides  the  striking  example  of  a  molecular  system  capable  of  exhibiting  an  inverse 
temperature  transition;  when  having  been  dissolved  in  water  at  low  temperature,  it  reversibly  crystallizes 
on  raising  the  temperature  and  dissolves  on  lowering  the  temperature  (8).  An  understanding  of  the 
molecular  basis  for  such  inverse  temperature  transitions  necessarily  resides  in  the  nature  of  the  hydration 
shell  that  surrounds  hydrophobic  (apolar)  moieties. 

Hydrophobic  Hydration  as  Basis  for  the  Inverse  Temperature  Transition 

Dissolution  of  hydrophobic  moieties  in  water  is  an  exothermic  process,  that  is,  AH  =  AE  +  PAV  is 
negative  and  heat  is  released,  but  the  solubility,  which  depends  on  a  negative  Gibbs  free  energy  of 
hydration  (AG[hydration]  =  AH  -  TAS),  of  hydrophobic  moieties  is  generally  low  because  of  a  significant 
negative  entropy  change  (AS)  as  water  molecules  in  bulk  solution  become  more-ordered  in  the  shell  of 
water  surrounding  the  hydrophobic  moieties.  Examples  of  the  nature  of  the  ordered  water  surrounding 
hydrophobic  groups  can  be  found  in  the  crystal  structures  of  alkane  gas  hydrates  (9).  Therefore,  even 
though  raising  the  tempmture  of  poly(VPGVG)  dissolved  in  water  can  cause  it  to  fold  and  assemble  into  a 
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lower  entr(^y  condensed  phase,  the  more-ordered  waters  of  hydrophobic  hydration  surrounding  the 
hydrophobic  moieties  in  its  dissolved  state  become  less-ord^^  bulk  wato*  as  the  polypeptide  aggregates. 
The  net  result  on  raising  the  temperature  through  the  transition  is  an  increase  in  disorder,  i.e.,  an  increase 
in  entropy,  for  the  system  as  a  whole  in  keeping  with  the  second  law  of  thermodynamics  (10-12),  but 
irqxxtantly  there  has  occurred  a  very  useful  increase  in  (»der,  decrease  in  entrq)y,  for  the  polypeptide  part 
of  the  system  as  has  been  clearly  demonstrated  by  studies  on  poly(VPGVG)  and  its  analogs  (13). 
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Figure  1 

Space  niling  models  in  stereo  per¬ 
spective  (for  cross-eye  viewing)  of 
the  folded  p-spiral  structure  of 
polyCVPGVG)  and  of  an  extended 
structure  where  all  of  the 
hydrophobic  groups  would  be 
exposed  to  solvent  and  surrounded 
by  waters  of  hydrophobic 
hydration. 


State  II 


State  I 


The  proposed  molecular  stracture  fw  polyCVPGVG)  is  shown  in  Figure  1 A  where,  in  the  helically 
folded  state  n,  there  are  hydrophobic  contacts  between  turns  of  the  particular  helical  structure  called  a  P- 
spiral.  It  is  the  optimization  of  intertum  hydrophobic  contacts  coupled  as  it  is  with  the  change  of  waters  of 
hydrophobic  hydraticm  to  bulk  water  that  is  the  driving  fence  for  folding  (6,7).  Figure  IB  is  the  extended 
state  for  which  there  are  no  intertum  hydrophobic  ctMtacts  and  in  which  the  hydrophobic  groups  would  be 
surrounded  by  waters  of  hydrophobic  hydration. 
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ATt  Mechanism  of  Protein  Folding  and  Free  Energy  Transduction 


Thermomechanical  Transduction:  In  the  unfolded,  low  temperature  state  (state  I),  cross-linked 
poly(VPGVG)  is  a  hydrogel  or  a  superabsorbent  which,  when  the  temperature  is  raised,  expels  substantial 
amoimts  of  the  water  to  become  the  folded,  the  more-dense,  condensed  phase  (state  11).  Principals  in  this 
SBIR  application  have  demonstrated  a  number  of  ways  whereby  the  temperature  of  the  folding  transition 
can  be  shifted.  This  means  that  it  becomes  possible,  without  a  change  in  temperature,  to  shift  from  state  I 
to  state  n  or  vice  versa.  Since  the  state  I  to  state  n  folding  transition  can  be  used  to  carry  out  work,  the 
cross-linked  matrix  contracts  on  raising  the  temperature  and  can  lift  weights  that  are  a  thousand  times 
greater  than  the  dry  weight  of  the  cross-linked  matrix.  Thus,  for  a  cross-linked  matrix  the  inverse 
temperature  transition  itself  is  seen  as  a  thermally  driven  contraction,  that  is,  X20.poly(VPGVG)  exhibits 
thermomechanical  transduction  (14-16). 

Chemomechanical  Transduction:  Now,  if  an  analog  of  X20-poly(VPGVG)  should  be  unfolded  at 
physiological  temperatures,  for  example,  and  a  chemical  concentration  change  causes  a  lowering  of  the 
transition  ten^erature  to  below  37°C,  then  the  chemical  concentration  change  will  drive  a  contraction  as 
the  system  shifts  to  the  folded  state.  This  is  chemomechanical  transduction,  and  it  has  been  achieved  by 
two  different  general  chemical  processes,  a  polymer-based  process  (16)  and  a  solvent-based  process  (17). 

The  polymer-based  process  utilizes  functional  side  chains  in,  for  example, 
poly[4(VPGVG),(VPGEG)]  where  E  =  Glu.  A  protonation  (due  to  a  decrease  in  pH),  causes  a  shift  from 
the  less  hydrophobic  state  (i.e.,  ftom  the  more  polar  CXX7”  moieties)  to  the  more  hydrophobic  state  (i.e., 
to  the  less  polar  COOH  moieties).  This  is  because  the  temperatiu^  of  the  transition  is  lowered  as  the 
polypeptide  becomes  more  hydrophobic,  and  it  is  raised  as  the  polypeptide  becomes  less  hydrophobic 
(16).  The  nx>lecular  interpretation  is  that  the  pull  of  the  COO~ moiety  to  achieve  its  waters  of  hydration 
destructures  the  waters  of  hydrophobic  hydration,  thereby  rennoving  the  thermodynamic  driving  force  for 
folding  (18).  This  interpretation  derives  from  the  observed  hydrophobicity  induced  increase  in  pKa  of 
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constituent  Glu  residues  on  increasing  hydrophobicity  by  conversion  of  Val^  to  Ile^  as  in 
poly[4(IPGVG),(IPGEG)]  ot  in  the  increase  in  pKg  of  Glu  residues  on  stretching  to  increase  exposure  of 
hydrophobic  groups  (19,20).  Even  though  in  the  latter  case  there  is  an  increase  in  the  amount  of  water  in 
the  cross-linked  matrix  on  stretching,  the  free  energy  of  COCr~  moieties  is  increased  causing  a  delay  in 
ionization  on  lowering  the  proton  concentration.  This  defines  the  mechanism  as  (d)X/d/)„^  <  0  for  the 

use  of  the  CXX)H/CXXII~  chemical  couple  to  achieve  chenoomechanical  transducticm  by  modulation  of  the 
temperature  of  an  inverse  temperature  transition.  In  the  partial  derivative,  /  is  force;  p.  is  chemical 
potential,  and  n  is  constant  composition  which  in  this  case  is  a  constant  degree  of  ionization,  a  (20). 

Significantly,  the  sign  is  reversed  for  the  same  chemical  couple  for  the  charge-charge  repulsion  mechanism 
where  (3p/9/)n  >  0  as  shown  by  Katchalsky  and  colleagues  (21).  Importantly  for  directing  protein 

folding  the  expression  of  hydrophobicity  depends  on  whether  a  proximal  Glu  residue  has  its  side  chain  as 
CXXDH  or  as  CXXJ". 

In  the  solvent-based  process,  changes  in  composition  of  the  solvent  can  change  the  temperature  of 
the  transition.  For  example,  increasing  the  concentration  of  salt  (NaCl)  lowers  the  transition  temperature 
(17);  adding  urea  raises  the  transition  temperature;  adding  ethylene  glycol  (22)  or  dimethyl  sulfoxide 
lowers  the  transition  temperature,  etc.  A  quantitative  understanding  of  how  solvent  composition  can  shift 
the  temperature  of  an  inverse  ten:^)erature  transition  would  improve  the  understanding  as  to  how  particular 
solutes  may  be  used  to  direct  proper  folding  of  proteins. 

Simplicity  of  the  Assay  System  and  efficiency  of  the  ATt  Mechanism  for  Chemomechanical 
Transduction:  For  both  thermomechanical  transduction  and  chemomechanical  transduction,  the 
demonstration  of  forces  developed  and  work  peifcxmed  can  be  by  means  of  the  simple  assay  of  watching  a 
strip  of  the  cross-linked  matrix  pick  up  a  weight  when  the  energy  source  is  thermal  for  thermomechanical 
transduction  (thermally  driven  folding),  or  pick  up  a  weight  when  the  energy  source  is  chemical,  for 
example,  as  when  the  salt  concentraticm  is  raised  or  when  the  concentration  of  proton  is  raised  to  achieve 
chemically  driven  folding  (16,17).  The  forces  are  obviously  real,  and  they  are  dramatic  with  an  efficiency, 
for  example,  in  the  conversion  of  chemical  energy  into  mechanical  work  that  is  more  than  an  order  of 
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magnitude  greater  than  previously  demonstrated  for  the  charge-charge  repulsion  mechanism  of 
chemomechanical  transduction  (13).  At  the  polypeptide  level,  the  iiKnease  in  tonperatuie  or  concentration, 
as  described,  drives  the  conversion  fix)m  the  unfolded  (hydrophobically  hydrated)  state  of  Figure  1 A  to  the 
contracted  (folded)  state  of  Figure  IB. 

Baromecharucal  Transduction:  It  has  recently  been  shown  that  a  change  in  pressure  can  shift  the 
transition  temperature  (23).  An  increase  in  pressiue  raises  the  temperature  of  the  transition  by  very 
significant  amounts  when  there  are  aromatic  residues  in  the  polymer.  The  molecular  interpretation  is  that 
waters  of  hydrophobic  hydration  surrounding  an  aromatic  residue  occupy  less  volume  than  bulk  water.  In 
terms  of  baromecharucal  transduction  in  a  water-filled  pressurizable  cylinder  with  a  weight  attached  to  a 
suspended  elastomeric  band  containing  aromatic  residues,  e.g.,  Phe  residues,  application  of  pressure 
would  cause  the  band  to  swell  and  lower  the  weight  whereas  releasing  the  pressure  would  cause  the  band 
to  contract  and  lift  the  weight.  This  has  now  been  observed.  Thus  pressure  could  be  used  to  unfold  a 
protein  containing  aromatic  residues  and  then  the  slow  release  of  pressure  could  be  used  to  allow  correct 
refolding  of  the  protein  and  this  could  be  done  simultaneous  with  changing  solute  concentration  by 
dialysis. 

Electromechanical  Transduction:  Etemcmstration  of  the  feasibility  of  electromechanical  transduction 
has  been  achieved  with  the  electrochemical  reduction  of  poly[3(GVGVP),(GK{NMcN}GVP)],  where 
(NMeN)  indicates  the  N-methyl  lucotinate  moiety  attached  by  amide  linkage  to  the  e-amino  of  lysine  (K). 
At  pH  9,  -1.6  volts  and  a  concentration  of  3.5  mg/ml,  the  electrochenucal  reduction  of  this  protein-based 
polymer  causes  a  decrease  in  the  value  Tt  from  47°C  to  36°C.  This  protein-based  polymer  has  also  been 
cross-linked  such  that  now  the  reduction  of  the  elastic  matrix,  with  the  use  of  a  mediatcH-,  will  be  attempted 
to  drive  contraction  (folding),  and  thereby  to  have  demcmstrated  electromechanical  transduction. 
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Thus  there  is  the  potential  for  all 

of  the  free  energy  transduction 

of  Figure  2  to  be  achieved  by 

controlling  folding  and 

assembly.  There  is  then 

demonstrated  potential  for  the 

four  transduction  coupled  to 

mechanical  force.  It  is  then  our 

proposal  that  any  free  energy 

input  -  pressure,  thermal, 

chemical  or  electrical  -  that  can 

drive  folding  to  give  the 

mechanical  component,  can  utilize  the  driving  force  for  folding  to  convert  energy  among  the  other  four  free 

energy  driving  forces.  This  is  controlling  higher  order  structure  to  achieve  function. 

Temperature  of  Inverse  Temperature  Transition,  Tf 
for  poly[/v(VPGVG),/x(VPGXG)] 

V 

0 

.1 

.2 
fx 

.3 

.4 

.5 

Figure  3;  Plot  of  the  temperature  of  the  inverse  temperature  transition,  Tt,  as  a  fur-iion  of  the  mole  fraction, /x,  of 
guest  residue  X.  The  order  of  the  residues  is  given  in  single  letter  code  for  the  closely  grouped  residues.  The 
order  and  values  of  Tp  for  the  residues  provides  an  experimental  hydrophobicity  scale.  (See  Table  I,  page  8.) 
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Temperature  of  Inverse  Temperature  Transition,  T^ 
for  poly[/v(VPGVG),/x(VPGXG)] 

(A  New  Hydrophobicity  Scale) 


Residue 

T,  at  /x  =  1 

Correlation 

X 

Coefficient 

Trp 

( W) 

_90«C 

0.993 

Tyr 

(Y) 

-SS^C 

0.999 

Phe 

(F) 

_30»C 

0.999 

Hi*  (pH  8) 

(HO) 

_io»c 

1.000 

Pro 

(P) 

(-8^*0) 

Lou 

(L) 

5»C 

0.999 

lie 

(1) 

lO^C 

0.999 

Met 

(M) 

20‘*C 

0.996 

Val 

(V) 

24’»C 

GIu(COOCH3)  (E*”) 

25®C 

1.000 

Glu(COOH) 

(EO) 

30*C 

1.000 

Cys 

(C) 

30*C 

1.000 

His  (pH  4) 

(H*) 

30OC 

1.000 

Ly#(NH2) 

(KO) 

350c 

0.936 

Asp(COOH) 

(DO) 

45^0 

0.994 

Ala 

(A) 

45*C 

0.997 

HyP 

50»C 

0.998 

Aon 

(N) 

50»C 

0.997 

Ser 

(S) 

50*C 

0.997 

Thr 

(T) 

50*C 

0.999 

Gly 

(G) 

55®C 

0.999 

Aro 

(R) 

60*C 

1.000 

Gin 

(Q) 

60*C 

0.999 

Lys(NH^) 

(K*) 

120'’C 

0.999 

Tyr(<>-0") 

(V) 

120°C 

0.996 

A8P(C00-) 

(O') 

170“C 

0.999 

Glu(COO~) 

(E') 

250*C 

1.000 

Ser(POj) 

lOOO^C 

1 .000 

8 


Sodium  Channel  Electrophorus  electricus 


251  W  M72  n»4  1400  iSOft 


tti  Subunit  of  DHP  Sensitive  Calcium  Channel 


Figure  4:  Mean  transition  temperature,  (Tt)ii.  plots  averaged  over  an  eleven  residue  sliding  window  for  bactcriorhodopsin 
(A),  halorhodopsin  (B),  glycophorin  (C),  cytochrome  b5  (D).  Na  Channel  Protein  (Electrophorus  electricus) 
(25)  (E)  and  Ca  Channel  Protein  (aj  subunit  of  DHP  sensitive)  (26)  (F).  Values  less  than  37°C  indicate 
sequences  that  would  be  hydrophobically  folded  or  interacted  as  in  the  case  of  a  transmembrane  helix.  In  each 
case  the  bar  indicates  a  length  of  23  residues,  the  approximate  number  of  residues  required  for  an  a-helix  to  span 
a  lipid  bilayer.  Note  for  each  membrane  protein  that  the  transmembrane  helices  are  (^served. 
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Hydrophobicity  Scale 


Understanding  the  contribution  of  each  chemical  moiety  of  a  protein  and  solute  to  the  temperature 
of  the  transition  make  it  possible  to  introduce  energy  in  one  form  to  shift  protein  folding  from  one  state  to 
another  while  directing  proper  folding.  As  a  majc»'  first  step  in  this  process,  a  new  type  of  hydrophobicity 
scale  has  been  experimentally  derived  (24)  which  is  the  relative  ccmtiibution  of  each  amino  acid  residue  to 
the  transition  temperature  of  an  inverse  temperature  transition  (see  Figure  3,  page  7  and  Table  I,  page  8). 
With  this  set  of  values  of  Tt  for  each  amino  acid  residue,  a  plot  of  a  mean  value  of  Tt,  i.e.  (Tt)  averaged 
over  a  sliding  window  of  a  given  number  of  residues  for  a  given  protein  sequence  can  evaluate  the  relative 
hydrophobicity  of  sequences.  In  particular  in  the  now  classical  test,  a  plot  of  (Tt)  versus  residue  number 
for  bacteriorhodopsin,  halorhodc^sin  glycophorin  A,  cytochrome  bs  and  the  sodium  and  calcium  channels 
is  capable  of  identifying  the  transmembrane  helices  (see  Figure  4,  page  9).  A  sequence  for  which  (Tt>  is 
less  than  37°C  would  be  hydrophobically  folded,  or  in  the  case  of  a  membrane  protein,  becomes  a 
potential  transmembrane  helix  (24). 

Enzymatic  Control  of  Folding  by  Phosphorylation 

The  most  dramatic  means  so  far  identified  for  shifting  the  temperature  of  the  inverse  temperature 
for  protein  folding  is  phosphorylation/dephosphorylation  (27).  Because  of  the  central  role  that 
phosphorylation  plays  in  modulating  protein  structure  and  function  (28),  an  elastomeric  polypentapeptide 
was  prepared  with  a  site  for  phosphorylation.  When  a  cyclic  AMP-dependent  protein  kinase  site  (29), 
RGYSLG,  is  introduced  into  the  polypentapeptide,  e.g.,  poly[3(XIPGVG),(RGYSLG)],  phosphorylation 
of  one-half  of  the  serine  residues  in  the  polypeptide  by  the  a  catalytic  subunit  of  the  protein  kinase  causes 
a  15°C  shift  in  the  transition  temperature  from  18°C  to  33®C.  For  an  intermediate  temperature  of  25°C,  for 
example,  the  phosphorylation  induced  shift  in  the  temperature  of  the  inverse  teiiq)erature  transition  would 
be  sufficient  to  be  essentially  fully  folded  before  phosphorylation  and  to  become  fully  unfolded  after 
phosphorylation  of  one-half  of  the  serine  residues  in  th  RGYSLG-containing  poly(IPGVG).  One 
phosphate  in  a  mean  molecular  weight  of  20,000  Da  is  sufficient  to  produce  such  a  dramatic  effect.  The 
interpretation  here  is  the  same  as  for  the  (XXT"  moiety.  The  phosphate  moieties  destructure  the  waters  of 
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hydrophobic  hydration  that  would  otherwise  provide  the  driving  force  for  folding  at  25°C.  ThCTefore,  the 
polypeptide  unfolds  on  phosphorylation.  Accordingly  should  a  phosphorylation  site  be  present  in  a 
microbially  produced  protein,  it  would  be  a  reasonable  consideration  to  phosphorylate  it  to  achieve  the 
proper  folding  fen*  the  phosphorylated  state,  with  care  taken  to  prevent  aggregational  potential,  and  then 
carefully  to  remove  the  phosphate  by  a  phosphatase  tho’eby  lowering  the  transition  temperature  to 
complete  the  refolding. 

It  is  now  becoming  apparent  that  if  biotechnology  is  to  more  fully  reach  its  potential  for  the 
production  of  proteins  by  their  expression  from  unnatural  cells  and  by  unnatural  processes,  it  will  be 
necessary  to  develop  rules  that  will  allow  the  protein  folding  required  f(X’  the  desired  property. 

2.  Technical  Objectives  of  Phase  I 

Elements  of  three  technical  objectives  are  being  addressed  in  Phase  I  which  are  central  to 
development  of  protein  folding  and  purification  strategies  and  to  the  design  of  protein  for  free  energy 
transduction: 

(1)  to  assess  the  magnitude  and  distance  dependence  of  the  repulsive  free  energy  of  interaction  that 
exists  between  carboxylate  moieties  and  hydrophobic  moieties  that  can  lead  to  large  shifts  in  the  pKa  (up  to 
several  pH  units,  i.e.,  a  target  of  3  pH  units)  of  the  carboxyl  moiety  and  the  raising  of  Tt,  the  temperature 
for  the  hydrophobic  folding  transition. 

(2)  to  evaluate  hydrophobic  domain  sizes,  both  as  to  intensity  of  the  hydrophobicity  and  the 
number  of  residues  in  the  domain,  that  can  give  rise  to  independent  folding  of  the  domains. 

and,  (3)  to  quantitate  the  effects  of  changes  in  solvent  composition  on  shifting  the  value  of  Tt,  for 
poly(VPGVG)  and  for  this  host  polypentapeptide  with  guest  residues  i.e.  for 
poly[/v(VPGVG)/x(VPGXG)]  where /v  and/x  are  mole  fractions  with  fy  +/x  =  1  and  where  X  is  the 
guest  residue. 


3.  Progress  in  Phase  I 


The  progress  achieved  in  Phase  I  can  be  discussed  in  terms  of  the  technical  objectives,  but  first  the 
syntheses  of  the  model  protein-based  polymers  will  be  enumerated. 

Peptide  Syntheses:  The  necessary  first  step  in  reaching  any  of  the  technical  objectives  is  the 
syntheses  of  those  protein-based  polymers  which  were  designed  to  provide  the  information  sought  in  the 
technical  objectives.  Extraordinary  progress  has  been  made  in  the  synthesis  component.  Instead  of  the 
proposed  ten  polypent^ptides  with  the  highest  complexity  being  polypentadecapeptides,  poly(15  mers), 
22  polypentapeptides  have  been  synthesized  and  another  6  are  under  polymerization  with  the  greatest 
complexity  being  the  much  more  difficult  polytricosapeptides,  poly(30  mers).  These  protein-based 
polymers  are  listed  in  Table  n,  page  13. 

a.  Hydrophobicity-Induced  pKg  Shifts 

(1)  Successful  Design  at  Nanometric  Dimensions  to  Enhance  Hydrophobicity-Induced 
pKa  Shifts. 

At  the  writing  of  *’  is  Phase  I  proposal,  a  one  pH  unit  hydrophobicity-induced  pl'a 
had  been  observed.  The  target  was  to  achieve  as  much  as  a  3  pH  unit  shift.  Our  objective  has  been 
exceeded.  What  has  actually  been  achieved  is  a  3.8  pH  unit  shift  This  is  described  with  a  succinct  focus 
in  what  follows. 

With  poly[4(GVGVP),(GEGVP)]  (I)  as  a  reference  state,  the  polytricosapeptides, 
poIy[3(GVGVP),2(GFGFP),(GEGFP)]  (H);  poIy[2(GVGVP),2(GVGFP),(GFGFP),  (GEGFP)]  (HI); 
poly(GEGFP  GVGVP  GVGVP  GVGVP  GFGFP  GFGFP)  (IV);  and  poly(GEGFP  GVGVP  GVGFP- 
GFGFP  GVGVP  GVGFP)  (V),  all  having  the  same  theoretical  and  essentially  the  same  analytical  amino 
acid  compositions,  were  synthesized  and  designed  fen*  the  specific  purpose  of  testing  the  limits  of 
hydrophobicity-induced  pKg  shifts  in  this  system.  Polymers  I,  n,  m,  IV  and  V  exhibited  pKg  values  for 
the  Glu(E)  residues  of  4.3,  4.7,  6.3,  7.7  and  8.1,  respectively.  Due  to  the  limited  number  of  Glu 
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TABLE  n 


Polymers  Synthesized 


Polymers 


37®C 


20®C 


CG56PF  poly(GVGVP 


CG58PF  poIy[(GEGFP)2(GVGVP)2(GVGFP)GFGFP 


CG59PF  poIy[(GEGFP)3(GVGVP)2(GFGFP 


CG61PF  doMGEGFP  GVGVP  GVGFP  GFGFP  GVGVP  GVGFP^ 


CG62PF  polvrGEGFP  GVGVP  GVGVP  GVGVP  GFGFP  GFGFP^ 


CG70PF  polyfGDGIP 


CG71PF 


CG72PF 


CG74PF 


CG75PF 


CG76PF 


8(GDGIP)0.2(GVGIP 


<(GDGIP)0.4(GVGIP 


4(GDGIP)0.6(GVGIP 


3(GDGIP)0.7rGVGIP 


2(GDGIP)0.8(GVGIP 


1(GDGIP)0.9(GVGIP 


05(GDGIP)0.95(GVGIP 


CG78PF  poly(GVGIP 


CG79PF  polyCGKGIP 


CG80PF 


CG81PF 


.8(GKGIP)0.2(GVGIP 


CG85PF 


CG86PF 


6(GKGIP)0.4(GVGIP 


4(GKGIP)0.6(GyGIP 


3(GKGIP)0.7(GVGIP 


2(GKGIP)0.8(GVGIP 


1(GKGIP)0.9(GVGIP 


05(GKGIP)0.95(GVGIP 


CG87PF  poly[(GEGIP)2(GVGVP)2GVGIP)(GIGIP 


CG88PF  polyr(GEGIP)3{GVGVP)2(GIGIP 


CG89PF  poly[(GDGIP)2(GVGVP)2GVGIP)(GIGIP 


CG90PF  poIyr(GDGIP)3(GVGVP)2(GIGIP 


CG91PF  poIyf(GKGIP)2(GVGVP)2GVGlP)(GIGIP 


CG92PF  poly[(GKGIP)3(GVGVP)2(GIGIP 


4.90 


5.18 


5.40 


4.80 


7.60 


residues,  the  structures  themselves  exclude  the  often  invoked  charge-charge  interaction  mechanism  for  pKa 
shifts.  The  differences  in  pKa  values  arise  from  the  differences  in  proximity  of  the  Glu(E)  and  Phe(F) 
residues  within  and  between  pentamers  with  optimal  arranging  of  five  nearest-neighbor  Phe  residues  at 
nanometer  distances  from  the  Glu  residue  in  FV  and  V.  This  remarkable  increase  of  3.4  pH  units  within 
polymers  n  through  V  of  essentially  the  same  amino  acid  composition  and  of  3.8  pH  units  between 
polymers  I  and  V  within  a  system  which  is  approximately  50%  water  by  weight  in  its  folded  state 
demonstrates  a  repulsive  free  energy  of  interaction  between  hydration  processes  of  the  Phe  and  Glu  side 
chains  of  4.6  kcal/mole  within  the  polytricosapeptide  group  and  of  S.l  kcal/mole  overall.  To  our 
knowledge  these  are  the  largest  pKa  shifts  documented  for  the  Glu  residue,  and  the  recently  identifred 
interactions  responsible  are  considered  to  be  important  free  energies  of  interaction  in  modulating  protein 
stracture  and  function  (13). 


Controlling  pKa  by  Design  at  Nanometric  Dimensions 

A  B  C  D 


Polymer  I: 

Polyl0,8(GVGVP),0.2(GEGVP)l 


Polymer  IV: 

PoWGEGFPGVGVPGVGVP- 

GVGVPGFGFPGFGFP) 


Polymer  V: 

Poly(GEGFPGVGVPGVGFP- 

GFGFPGVGVPGVGFP) 


I  }  Phe  (F) 

O  Glu  (E) 

Figure  5  Schematic  representation  of  the  ^spiral  structure  proposed  fw  the  polypentapcptide,  polyfVal^-Pro^-Gly^- 

Val'^-Gly^),  which  is  equivalent  to  poly(GVGVP).  The  Pro^-Gly^  P-tum  is  seen  to  function  as  a  spaco* 
between  the  turns  of  the  P-spiral  in  A.  and  the  hydrophobic  folding  arises  principally  from  intertum 
Vall'yCH3  '  Pro^  P-CH2  mteractives.  In  B.  there  are  occasional  Glu(O)  residues  randomly  dispersed.  In  C. 
the  Phe  residues  are  placed  in  proximity  to  the  Glu  residues  based  on  primary  structure.  In  D.  the  Phe 
residues  are  placed  in  proximity  to  the  Glu  residues  based  on  a  P-spiral  with  apixoximately  3  pentamers  per 
turn,  i.e.,  based  on  tertiary  structure. 
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Polymers  IV  and  V  were  designed  on  the  basis  of  the  working  ^-spiral  structure  of 
poly(GVGVP)  shown  schematically  in  Figure  5A  (13,16).  For  the  P-spiral  structure,  the  distance 
between  turns  is  approximately  one  nanometer  (30,31).  This  results  in  the  nanometer  proximities  of  the 
Glu  and  Phe  residues  shown  in  Figure  5C  and  D.  The  polytricosapeptides  were  constructed  by  the 
sequential  addititm  of  pentamers.  The  composite  pentamers  were  also  mixed  in  the  appropriate  ratios  and 
polymerized  to  obtain  polymers  n  and  HI  with  a  more  random  ordering  of  pentamers.  A  complete 
description  of  the  syntheses  will  be  presented  elsewhere.  The  structtu^s  were  v«ified  by  carbon- 13 
nuclear  magnetic  resonance  and  amino  acid  analyses.  The  amino  acid  analyses  for  polymers  I  through  V 
are  given  in  Table  m  (page  16)  for  the  average  values  and  standard  deviations  of  four  runs  on  each 
sample. 
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Figure  6  An  acid-base  titration  curve  for  polymer  V  and  the  resolved  curve  included  as  an  inset  The 

starting  concentratitm  was  40  mg/ni^  and  the  temperature  was  20®C. 


Acid-Base  Titration  Data  and  Resolved  Curve 
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The  raw  add-base  titration  data  and  the  resolved  curve  for  one  of  the  three  titrations 
for  polymer  V  are  given  in  Figure  6  (page  15)  and  the  averaged  value  and  standard  deviations  for  three 
runs  each  for  polymers  n  through  V  are  included  in  Table  HI  below.  Curve  resolution  to  obtain  a,  the 
degree  of  ionization,  and  the  pKa  value  utilized  the  second  derivative  of  the  experimental  titration  curve. 
The  negative  extremum  for  the  second  derivative  signals  die  start  of  the  ionization  process,  and  the  positive 
extremum  signals  the  completion  of  the  ionization  process.  The  midpoint  between  the  two  extrema  is  the 
pKa  value  (32,33).  Resolved  acid-base  titration  curves  for  polymers  I,  H,  m,  IV  and  V  are  given  in 
Figure  7  (page  17). 


Table  m 

Amino  add  compositon,  experimental  pKa  values  and  calculated  mean  hydrophobidties,  <  Tj ),  for  polymers  I  through  V 


Amino  Acid 
or  Property 

Polymer 

Found 

Theoretical 

II 

Fourxl 

■9 

IV 

Found 

V 

Found 

II- V 

Theoretical 

Glutamic  Add  (E) 

0.20 

0.20 

0.17  ±  0.01 

0.15  ±  0.01 

0.16  ±  0.01 

0.14  ±  0.01 

0.17 

Glycine  (G) 

1.99 

2.00 

2.13  ±  0.03 

1.89  ±  0.03 

2.05  ±  0.01 

1.98  ±  0.02 

2.00 

Proline  (P) 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

Valine  (V) 

1.79 

1.80 

1.03  ±  0.16 

1.10  ±  0.10 

0.99  ±  0.04 

1.00 

Phenylalanine  (F) 

1 

1 

1 

1 

1 

1 

0.74  ±  0.06 

0.90  ±  0.01 

0.79  ±  0.06 

0.84  ±  0.01 

0.83 

Experimental  pKa 

4.3  ± 

0.02 

4.7  ±  0.05 

6.3  ±  0.05 

7.7  ±  0.1 

8.1  ±  0.1 

E® 

Calculated  <  Tf  > 

30" 

23" 

e 

O 

22" 

21" 

E* 

39" 

30" 

e 

(D 

CM 

29" 

27" 

In  the  amino  add  analyses,  Pro  is  taxen  as  1.0. 
E®  stands  for  Glu  (COOH)  and  E"  for  Glu(COO‘). 
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Structure  Dependent  Hydrophobicity  Induced  pKa  Shifts 


Figure  7  Resolved  acid-base  titration  curves  for  polymers  I  through  V.  A  remarkable  overall  pKa 
shift  of  3.8  pH  units  is  observed.  For  all  curves  the  starting  concentration  was  40  m^ml 
and  the  temperature  was  20°C. 

Poly(GVGVP)  and  analogs  thereof  undergo  phase  transitions,  the  temperatures  for 
which  vary  systematically  with  composition  (13).  The  general  compositions  can  be  written 
poly(/v(GVGVP),/x(GXGVP)]  where  fy  and  /x  are  mole  fractions  with  /v  +  /x  =  1  ^d  where  X  can 
be  any  amino  acid  residue  or  chemical  modification  thereof.  These  polypentapeptides  are  soluble  in  water 
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at  low  temperatures  but  on  raising  the  temperature  aggregation  and  settling  occurs  to  form  a  more  dense 
viscoelastic  phase,  called  a  coacervate,  which  is  approximately  50%  water  by  weight.  The  onset 
temperature  for  this  transition  is  indicated  as  Tt.  Changes  in  composition  change  the  value  of  Tt  in  a 
systematic  way  such  that  plots  of  Tt  vs.  fx  are  linear  (See  Figure  3,  page  7)  and  such  that  more 
hydrophobic  residues  lower  the  value  of  Tt  and  less  hydrophobic  residues  than  Val  and  more  polar 
residues  increase  the  value  of  Tt.  This  property  has  been  used  to  develqp  a  hydrophobicity  scale  (24)  (See 
Table  I)  in  which  the  reference  values  for  Tt  are  obtained  by  extrapolating  to  /x  =  1-  The  values  of  Tt 
range  from  -90°C  for  poly(GWGVP)  to  250°C  fw  poly(GEGVP)  with  Glu(COO“). 

The  mean  residue  hydrophobicities,  (Tt) ,  calculated  using  this  hydrt^hobicity  scale 
and  the  amino  acid  analysis  data  are  included  in  Table  m,  page  16,  for  polymers  I  through  V  where  the 
values  of  (Tt)  are  seen  to  be  nearly  the  same  for  polymers  n  through  V  yet  the  pKa  values  differ 
remarkably.  Previous  studies  have  shown  that  increased  hydrophobicity  increases  pKa  (13,20,34).  The 
present  study  was  designed  to  determine  the  effect  of  difforing  proximity  of  Glu  and  Phe  residues  on  pKa. 
Significant  differences  in  pKa  values  were  also  observed  between  polymers  IV  and  V  where  the  sequences 
are  fixed  and  where  variable  inccHporation  of  different  pentamers  during  polymerization  would  not  be  a 
factor. 


The  dominance  in  this  nmlecular  system  of  the  tertiary  structure  over  the  primary 
structure  in  achieving  the  largest  pKa  shift  can  be  exen^rlified  by  mean  residue  hydrophobicity  plots  fca*  an 
internal  six  repeats  (180  residues)  using  a  sliding  window  of  eleven  residues  for  the  average  value,  i.e., 
(Tt)ii  for  polymers  FV  and  V  (24).  These  plots  are  given  in  Figure  8  where  the  smaller  number  indicates 
the  greater  hydrophobicity.  The  value  of  (Tt)ii  for  the  Glu  COOH  state  is  15  for  polymer  IV  and  20  for 
V,  and  the  value  for  the  Glu  COO~  state  is  35  fw  IV  and  40  for  V.  Thus,  based  on  primary  structures, 
the  Glu  residues  in  Polymer  IV  would  experience  the  greater  hydrophobicity  and  would  be  expected  to 
give  the  larger  pKg  shift  The  pKa  shift  is  greater,  however,  for  polymei  V.  Oily  when  the  proper 
folding  is  taken  into  account  does  the  spatial  proximity  bectmie  apparent  and  this  Glu-Phe  proximity 
resulting  from  the  p-spiral  folding,  as  shown  in  Figure  5C  and  D,  provides  the  understanding  for  the 
larger  pK^  shift  being  exhibited  by  Polymer  V. 
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Mean  Residue  Hydrophobicity  Plots  for  the  Poly(tricosapeptides),  IV  and  V 
Sliding  Window  of  11  Residue  Average, 

Polymer  IV  Polymer  V 


•  Value  of  ( T(  for  the  Glu  residue  for  each  polymer  for  both  the  Glu  COOH  and  Glu  COCT  states 
- Mean  value  of  <  )  based  on  the  amino  acid  analysis  data 


Figure  8  Mean  residue  hydrophobicity  plots  with  an  eleven  residue  sliding  \^dow  plotted  at  the 
central  residue  of  count  for  polymers  IV  and  V .  The  small  dot  positions  the  recur^g  Glu 
residues  and  the  dashed  lines  are  the  mean  values  of  Tt  calculated  from  the  amino  acid 
analysis  data.  The  upward  deflections  (smaller  numbers)  signify  greater  hydrophobicity. 
Note  that  the  Glu  residues  for  polymer  FV  based  on  primary  stracture  are  in  the  greater 
hydrophobic  environment  than  for  polymer  V,  yet  polymer  V  exhibits  the  greater  pKa  shift. 
This  is  considered  to  be  due  to  the  tertiary  structure  as  seen  in  Figure  ID. 
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Rxxn  the  piimaiy  structures  themselves,  it  is  clear  that  the  differences  in  pKa  do  not 
arise  from  the  commonly  considered  electrostatic  interactions  (35,36).  The  pKa  shifts  instead  are  an 
expression  of  a  repulsive  free  energy  of  interaction  which  exists  between  the  hydration  shells  of 
hydrcq)hobic  (apolar)  and  polar(CXXr^  moieties  as  discussed  in  more  detail  elsewhere  (13).  The  change 
in  Gibbs  free  energy  per  mole  obtained  from  the  change  in  chemical  potential,  Ap.,  required  to  maintain 
50%  ionization  of  the  side  chain  is  given  by  Ap  =  -2.3  RT  ApKa.  At  20°C  with  a  ApKa  of  3.8  pH  units, 
Ap  is  5  kcal/mole,  that  is,  there  is  observed  a  repulsive  Gibbs  free  energy  of  interaction  arising  out  of  the 
proximity  of  Phe  and  Glu  side  chains  in  polymer  V. 

It  is  not  the  Glu  side  chain  becon^g  folded  in  the  hydrophobic,  water-free  interior 
of  a  globular  protein  that  provides  an  adequate  description  of  the  pKa  shift;  rather  it  is  the  competition 
between  Glu  and  Phe  side  chains  for  appropriately  stractured  hydration  shells  in  a  filamentous  folded  state 
having  approximately  50%  water  by  weight  that  is  responsible  for  the  unfavorable  free  energy  for 
carboxylate  anion  formation  (13).  This  is  referred  to  as  an  apolar-polar  repulsive  free  energy  of  hydradtxi. 
This  mechanism  can  be  used  to  drive  chemomechanical  transduction  as  the  polypeptide  folding  into  a 
^-spiral  can  be  used  to  lift  a  weight  in  elastomeric  bands  obtained  on  y-iiradiation  of  the  coacervate  state 
(13,16,17).  This  mechanism  for  driving  folding  and  unfolding  is  an  order  of  magnitude  more  efficient  in 
the  amount  of  chemical  energy  (due  to  an  increase  in  proton  concentration)  required  to  lift  a  given  weight 
than  is  the  charge-charge  repulsion  mechanism  using  the  same  COOH/CC)Cr~  chemical  couple  (13,21), 
and  0\i/Bf)a,  where  /  is  force,  is  negative  for  the  new  mechanism  (13,35)  and  positive  fn:  the  latter  (21). 

In  this  case  the  formation  of  a  charged  species  is  viewed  as  destroying  the 
structured  water  surrounding  hydrophobic  moieties  when  exposed  to  water  such  that  the  thermodynamic 
driving  force  for  hydrophobic  folding  is  lost  and  unfolding  can  occiu:.  The  phosphate  moiety  attached  to  a 
serine  residue  has  been  ^wn  to  be  the  most  potent  chemical  moiety  in  destroying  stractured  hydrophobic 
hydration,  that  is,  phosphtnylation  is  the  nmst  effective  moiety  on  a  molar  basis  for  driving  unfolding. 
One  pho^hare  moiety  in  300  residues  in  poly[3(KIPGVG),(RGYSLG)]  (VI)  can  raise  the  ten^rature  of 
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the  folding  transition  by  14®C  from  18®C  to  32®C  (13,27).  Thus  phosphorylation  of  the  Ser  residue  of 
polymer  VI  to  the  extent  of  one  phosphate  per  3(X)  residues,  at  25®C,  can  result  in  the  unfolding  of  the 
polymer.  This  gives  for  the  hydrophobicity  scale  a  value  of  Tt  for  Ser-phosphate  of  over  1,{X)0®C. 
Because  of  the  ubiquitous  use  of  phosphorylation  to  drive  protein  function,  because  of  the  dramatic  effect 
of  phosphorylation  in  the  mechanism  demonstrated  by  this  system  and  because  of  the  efficiency  of  this 
mechanism  in  driving  polypeptide  folding  to  perform  chemomechanical  transduction,  it  is  believed  that  the 
mechanism  demonstrated  here  in  terms  of  dramatic  pKa  shifts  would  be  a  dominant  mechanism  in  protein 
structure  and  function. 

(2)  The  Relative  Sensitivity  of  Amino  Acid  Side  Chains  to  Hydrophobicity-Induced 
pKa  Shifts. 

Comparison  of  the  relative  sensitivity  of  ionizable  amino  acid  side  chains  to 
hydrc^hobicity-induced  pK^  shifts  should  be  achieved  under  conditions  wherein  the  charge-induced  pK^ 
shifts  can  be  shown  to  be  negligible. 

As  the  first  comparison  Glu(E)  and  Asp(D),  both  of  which  have  the  carboxyl  side 
chains,  are  studied  in  the  protein-based  polymers,  poly[/v(GVGIP),/E(GEGIP)]  and 
poly[/v(GVGIP)/D(GDGIP)]  where  the  mole  fractions, /e  and /d  are  varied  between  one  and  0.05,  and 
the  acid-base  titrations  are  carried  out  in  normal  saline  at  37®C.  Plots  of  fe.  and/o  versus  pKa  are  given  in 
Figure  9.  In  both  cases  there  is  no  significant  pKa  change  in  the  1.0  to  0.8  mole  firaction  range.  At  lower 
mole  fractions,  that  is,  as  the  carboxyl  containing  side  chain  is  further  replaced  by  an  isq)r(q)yl  side  chain, 
the  pKa  systematically  increases  up  to  values  {q)proaching  7.0. 
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Figure  9 

It  is  clear  from  the  data  in  Figure  9  that  the  observed  pKa  shifts  are  due  exclusively 
to  hydrophobicity  increases.  This  is  because  charge-induced  pKa  shifts  would  be  seen  as  increases  in 
pKa  as  the  mole  fractions /e  and/o  increased  rather  than  decreased.  A  situation  where  both  charge- 
induced  and  hydrophobicity-induced  shifts  were  observed  would  have  curves  which  were  parabolic  with 
regard  to  the  mole  fraction  axis.  Interestingly,  both  the  pKa  for  Glu  and  that  for  Asp  approach  7  as  the 
mole  fraction  drops  below  0.1.  As  the  pKa  for  Asp  begins  at  about  3.8  for/o  =  1  whereas  that  for  Glu  is 
near  4.3,  the  greater  pKa  shift  appears  to  be  exhibited  by  Asp.  This  will  be  tested  by  addidonal  data  points 
for  Asp  in  the  0. 14  to  0.08  mole  fraction  range  and  in  the  polytricosamer  system  of  Figure  5D  where  Glu 
is  replaced  by  Asp.  This  is  planned  in  Phase  II. 

b .  Hydrophobic  Domain  Size  and  Intensity  Studies 

When  poly(GVGVP)  and  poly(GVGIP)  are  mixed  at  low  temperatures,  below  lO^C,  and 
the  temperature  of  the  mixture  is  raised,  the  latter  begins  separating  out  (coacervating)  at  1 1°C,  whereas  the 
former  does  so  at  28®C.  The  molecules  self  separate  with  two  transitions  based  on  their  hydrophobicities. 
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When  the  random  copolypentapeptide,  poly[0.5(GVGVP),0.5(GVGIP)],  is  studied,  it  separates  out 
(coacervates)  at  an  intermediate  temperature  of  19°C  as  a  single  transition.  The  objective  is  to  find,  for  a 
set  of  protein-based  polymers  (GVGVP)i-(GVGIP)fn.  the  lower  limit  of  1  and  m,  for  which  the  two 
domains  will  exhibit  separate  transitions.  To  date,  low  enough  molecular  weights  have  yet  to  be  obtained. 
Presently  underway  are  polymerizations  fen*  4, 8  and  16  hours  which  are  to  be  fractionated  by  dialysis  and 
coupled. 


In  addressing  the  issue  of  hierarchical  hydrophobic  folding  in  proteins,  the  pentamers 
required  to  synthesize  the  polymers,  that  is,  Boc-GVGVP-OH,  Boc-GVGIP-OH  and  Boc-GFGVP-OH, 
have  been  synthesized.  A  screening  effcHt  was  undertaken  to  obtain  polymers  of  lower  molecular  weight, 
i.e.,  less  than  50  kDa.  The  efforts  are: 

Boc-GVGVP-ONp  was  deblocked  with  TFA  and  polymerized  fco’  48  hours  instead  of  our 
usual  14  days  (to  obtain  greater  than  50  kDa)  and  dialyzed  against  50,000  molecular  weight  cut-off 
dialysis  tubing  and  lyophilized.  Surprisingly,  the  yield  in  terms  of  the  quantity  of  polymer  retained  in 
50,000  molecular  weight  cut-off  tubing  after  extensive  dialysis  against  water  was  in  the  75  -  80%  range 
with  a  TPe  of  25°C.  This  means  that  high  quality  high  molecular  weight  (GVGVP)]  was  obtained  with  1  > 
120  in  just  48  hours  of  pdymerization. 

Boc-GVGVP-OH  was  deblocked  and  polymerized  for  24  hours  using  EDCI  as  the 
polymerizing  agent  in  the  presence  of  HOBt  The  polymer  was  dissolved  in  water  and  dialyzed  against 
50,000  molecular  weight  cut-off  dialysis  tubing  and  lyophilized.  It  was  found  that  the  yield  of  the 
polymer  having  greater  than  50,000  molecular  weight  was  about  75%.  It  has  also  been  found  in  our 
previous  studies  that  EE)CI  alone  does  not  produce  polymer  having  molecular  weight  greater  than  3500. 

Boc-GVGVP-OH  was  deblocked  and  polymerized  for  4  hours,  8  hours,  and  16  hours 
sepaiately  using  EDO  and  HOBt  The  50,000  dialysis  of  these  polymers  is  in  progress.  By  using  the 
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shorter  times,  it  is  hoped  that  lower  molecular  weights  will  be  obtained  which  could  then  be  coupled  to 
(GVGIP)in  polymers  similarly  prepared.  An  additional  approach  is  included  in  the  Phase  n  proposal. 

c.  Effect  of  Protein  Naturation  and  Purification  Solutes  on  Hydrophobic  Higher  Order 

Stmcturing. 

There  are  many  solutes  which  are  routinely  used  in  protein  renaturation/denaturation  and 
purification  efforts  for  which  a  better  understanding  of  function  is  required.  Our  noodel  protein-based 
polymers  allow  identification  and  even  quantitation  of  their  relative  effects  on  hydrophobic  folding  and 
assembly.  The  solutes  studied  in  Phase  I  are  urea,  guanidine  HCl,  sodium  dodecyl  sulfate  (SDS),  Triton 
X-100,  ammonium  sulfate,  sodium  chloride,  trifluorocthanol  (TFE),  trifluoroacetic  acid  (TFA),  and 
glycerol.  All  of  these  solutes  have  been  used  in  protein  purification,  renaturation/denaturation  and 
structure  studies.  The  results  are  as  follows;  In  order  of  potency  in  raising  the  temperature,  Tt,  of  the 
inverse  temperature  transition  of  40  mg/ml  of  poly(GVGVP)  are  SDS  (600®C/mole,  measured  at  0.1  M), 
guanidine  HQ  (120°C/mole  at  0.3  M),  urea  (55°amole  at  0.3M),  and  Triton  X-100  (3(fC  at  0.3  M).  This 
is,  therefore,  the  relative  effectiveness  in  disrupting  hydrophobic  folding  and  assembly.  These  data  are 
plotted  in  Figure  lOA  (Data  for  triton  X1(X)  ends  at  0.3  M  since  at  higher  ctMicentrations  it  separates  out  of 
solution.).  In  CHxler  of  potency  in  enhancing  hydrophobic  association,  that  is,  in  lowering  the  value  of  Tt 
for  40  mg/ml  of  poly(GVGVP)  are  ammonium  sulfate  (-80°CVnx>le  measured  at  0.2  M),  sodium  chloride 
(-14°C/moie  at  0.4  M),  trifluoroethanol  (-ll^C/mole  atO.4  M),  trifluoroacetic  acid  (-9°C7molc  at  0.4 
M),  and  glycerol  (-2.2®Cymole  at  0.4  M).  The  data  are  given  in  Figure  lOB.  Also  included  in  Rgiue  10 
are  data  for  dimethylsulfoxide  (DMSO)  and  ethylene  glycol  which  are  plotted  with  respect  to  the  top  scale 
in  the  figiue  from  0  M  to  S  M.  It  is  now  becoming  possible,  therefore,  to  plan  a  strategy  based  on  primary 
structure  to  klentify  regions  in  a  protein  and  use  the  above  studied  solutes  to  fold  and  unfold  or  assemble 
and  disassemble  hydrophobic  associations.  What  is  required  now  is  a  determination  of  the  relative 
effectiveness  of  these  as  a  function  of  degree  of  hydrophobicity,  that  is,  it  is  now  necessary  to  carry  out 
these  studies  on  protein-based  polymers  oi  diffning  mean  hydrophobicities  as  measured  by  the  scale 
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of  Table  I.  Studies  to  date  using  sodium  chloride  indicate  that  the  slope  of  — 14°C/mole  is  the  same  for 
different  hydiophobicities.  This  will  need  to  be  determined  for  the  other  solutes. 


O  .2  .4-  .S  .S  1 


molarity 

Figure  10 
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4. 


Technical  Feasibility  Demonstrated  in  Phase  I 


Two  fundamental  contributions  have  been  made  in  Phase  1.  The  target  of  a  Glu  pKa  shift  of  up  to 
3  pH  imits  was  exceeded  and  a  series  of  solutes  within  the  new  ATt  paradigm  can  be  readily  defined  as  to 
whether  and  with  what  efficacy  they  enhaitce  or  disrupt  hydrophobic  folding  and  assembly  in  proteins. 

The  observed  pKa  shift  for  Glu  from  4.3  to  8.1,  to  our  knowledge,  is  the  largest  shift  ever 
documented  for  a  Glu  residue,  and  it  was  achieved  by  design  using  polytricos^ptides  in  which  5  apolar 
Phe(F)  side  chains  were  at  approximately  1  nm  distances  fixun  the  polar  Glu  (CXXDH)  side  chain  (See 
Figtire  5).  This  pKa  shift  represents  a  free  energy  of  interaction  of  over  5  kcal/mole,  as  calculated  from  the 
expression  for  the  change  in  chemical  potential,  Ap,  required  to  maintain  the  Glu  side  chain  at  0.5  degree 
ionization,  i.e.,  Ap,  =  2.3  RT  ApKa  with  R  =  1.987  cal/noole  degree  and  T  in  ®K.  These  are  referred  to  as 
apolar-polar  repulsive  free  energies  of  hydration,  and  they  are  clearly  the  magnitudes  of  interaction 
energies  that  control  protein  folding  and  drive  protein  function. 

The  data  of  Figures  10  A  and  B  show  the  relative  effects  of  solutes  in  disrupting  (raising  the  value 
of  Tt)  or  in  enhancing  (lowering  the  value  of  TJ  hydrophobic  folding  and  assembly,  which  is  basic  to 
higher  order  structuring  in  proteins.  AccOTclingly,  the  coarse  outline  of  an  approach  for  achieving  properly 
folded  and  assembled  protein  begins  to  emerge.  One  begins  with  the  (Tt)ii  profile  for  the  protein  of 
interest,  examples  of  which  are  given  in  Figure  4.  These  identify  the  hydrophobic  domains.  Next  with 
the  data  of  Figures  10  A  and  B  the  hydrophobic  associations  within  and  between  the  domains  can  be 
controlled  by  addition  or  removal  of  a  particular  solute  and  while  favoring  or  disfavoring  secondary 
structure  formation.  For  example,  consider  TFE  and  TFA,  both  of  which  enhance  hydrophobic 
association.  TFE  enhances  secondary  structure,  particularly  a-helix  formation,  while  TFA  disrupts 
secondary  structure.  This  means  that  it  is  possible,  depending  on  the  folding  regimen  to  be  tried,  to 
enhance  or  to  disrupt  secondary  structure  while  favoring  hydrophobic  association. 
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When  the  variable  of  solute  is  combined  with  effective  use  of  the  degree  of  ionization  of  particular 
functional  side  chains  within  or  proximal  to  the  hydrophobic  domains,  consid^able  control  over  the  higher 
order  structuring  of  protein  is  obtained.  What  is  now  required  are  the  determination  of  the  hydrc^hobicity- 
induced  pKa  shifts  of  the  rest  of  the  amino  acid  residues,  the  detmnination  of  hydrophobic  domain  size 
and  intensity  for  independence  of  folding,  the  determination  of  combinatorial  effects  of  organic  and 
inorganic  solutes  with  varied  hydrophobicity,  and  the  determination  of  the  effect  of  hydrophobicity  on 
enhancing  charge-charge  interactions.  These  are  plaimed  in  the  Phase  n  effcat. 

Therefore  in  Phase  I,  an  extraordinary  candidate  system  has  been  identified  and  partially 
characterized  for  detailed  study  and  design  of  appropriate  methodology  for  verification  of  rules  governing 
higher  order  (tertiary  and  quaternary)  structure  fcxmation. 
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